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Abstract 

For connectivity of random geometric graphs, where there is no 
density for underlying distribution of the vertices, we consider n i.i.d. 
Cantor distributed points on [0, 1]. We show that for this random geo- 
metric graph, the connectivity threshold -/?„, converges almost surely to 
a constant 1—20 where < (/> < 1/2, which for the standard Cantor dis- 
tribution is 1/3. We also show that ||i?„ - (1 - 2./)) ||^ -2C(0) n~^/'^* 
where C (0) > is a constant and :— — log2/log(/i is the Hausdorff 
dimension of the generalized Cantor set with parameter (f). 
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1 Introduction 



1.1 Background and motivation 

A random geometric graph consists of a set of vertices, distributed randomly 
over some metric space, in which two distinct such vertices are joined by an 
edge, if the distance between them is sufficiently small. More precisely, let 

be a set of n points in W^, distributed independently according to some 
distribution F on M"^. Let r be a fixed positive real number. Then, the 
random geometric graph G = GiVn,r) is a graph with vertex set Vn where 
two vertices v = (wi, . . . , Vd) and u = (ui, . . . , Ud) in Vn are adjacent if and 
only if II V — u|| < r where ||.|| is some norm on W^. 

A considerable amount of work has been done on the connectivity thresh- 
old defined as 



-Rn = inf i r > 



G(Vn,r) is connected I . (1) 



The case when the vertices are assumed to be uniformly distributed in 
[0,1]'^, [1] showed that with probability one 

n^oo logn { ior d>2 ^ ^ 

when the norm ||.|| is taken to be the Coo or the sup norm. Later [2] showed 
that the limit in ([2]) holds but with different constants for any £p norm for 
1 < p < oo. [3] considered the case when the distribution F has a continuous 
density / with respect to the Lebesgue measure which remains bounded away 
from on the support of F. Under certain technical assumptions such as 
smooth boundary for the support, he showed that with probability one, 

lim Rt = C 

n^oo log n 

where C is an explicit constant which depends on the dimension d and es- 
sential infimum of / and its value on the boundary of the support. Recently, 
[1] [personal communication], studied a case when the density / of the un- 
derlying distribution may have minimum zero. They in particular, proved 
that when the support of / is [0, 1] and / is bounded below on any compact 
subset not containing the origin but it is regularly varying at the origin, 
then Rn/F~^{l/n) has a weak limit. 

The proof by [3] can easily be generalized to the case where the density is 
zero at finitely many points. A question then naturally arises what happens 
to the case when the distribution function is flat on some intervals, that is, 
if density exists then it will be zero on some intervals. Also what happens 
in the somewhat extreme case, when the density may not exist even though 
the distribution function is continuous and has flat parts. To consider these 
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questions, in this paper we study the connectivity of random geometric 
graphs where the underlying distribution of the vertices has no mass and is 
also singular with respect to the Lebesgue measure, that is, it has no density. 
For that, we consider the generalized Cantor distribution with parameter (p 
denoted by Cantor{(j)) as the underlying distribution of the vertices of the 
graph. The distribution function is then flat on infinitely many intervals. 
We will show that the connectivity threshold converges almost surely to 
the length of the longest flat part of the distribution function and we also 
provide some finer asymptotic of the same. 



1.2 Preliminaries 

In this subsection, we discuss the Cantor set and Cantor distribution which 
is defined on it. 



1.2.1 Cantor Set 

The Cantor set was first discovered by [5j but became popular after [6]. 
The Standard Cantor set is constructed on the interval [0, 1] as follows. 
One successively removes the open middle third of each subinterval of the 
previous set. More precisely, starting with Co := [0, 1], we inductively define 



U 

fc=l 



^n.k ^n.k 



u 



^n.k 



-'n,k 



^n,k 



-'n,k 



The Standard Cantor set is then defined as 



where C„ := U [a„,fc,&n,fc] 
k=l 

°° Cn- It is known that the Hausdorff dimension of the standard 



m=o 



c = n 

Cantor set is (see Theorem 2.1 of Chapter 7 of [7]). 

For constructing the generalized Cantor set, we start with unit interval 
[0, 1] and at first stage, we delete the interval ((/>, 1 — (j)) where < < 1/2. 
Then, this procedure is reiterated with the two segments [0, (j)] and [1 — (/>, 1]. 
We continue ad infinitum. The Hausdorff dimension of this set is given by 

:= — 1^1^ (see Exercise 8 of Chapter 7 of [7J). Note that the standard 
Cantor set is a special case when (j) = 1/3. 



1.2.2 Cantor distribution 

The Cantor distribution with parameter </> where < < 1/2 is the distri- 
bution of a random variable X defined by 

oo 

X = ^0-iZ, (3) 

i=l 

where Zi are i.i.d. with P[Zi = 0] = P[Zi = 1 - 0] = 1/2. If a random 
variable X admits a representation of the form ([3]) then we will say that 
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X has a Cantor distribution with parameter (j), and write X ~ Cantor (0). 
Observe that Cantor (0) is self-similar, in the sense that, 

Y (i j with probability 1/2 , , 

~ \ (pX + with probability 1/2 ^ ' 

This follows easily by conditioning on Zi. It is worth noting here that if 
X ~ Cantor((/)) then so does 1 — X. 

Note that for (p = 1/3 we obtain the standard Cantor distribution. 



2 Main results 

Let Xi,X2, ■■■,Xn be independent and identically distributed random vari- 
ables with Cantor((^) distribution on [0,1]. Given the graph Q = Q(yn,r), 
where Vn = {Xi,X2, ■ ■ ■ , Xn}, let i?„ be defined as in ([T]). 

Theorem 1. For any < (p < 1/2, as n — > oo we have 

Rn — > 1-2(1) a.s. (5) 

Our next theorem gives finer asymptotics but before we state the theo- 
rem, we provide here some basic notations and facts. Let m„ :=min{Xi, X2, . . . , Xn\. 
Using dH) we get 

( (j)mk with probability 2~"(^) for k = l,2,...,n , . 

" 1^ (j)mn + 1 — (j) with probability 2~" 

Let an := E[m„]. Using ^ [8j derived the following recursion formula for 
the sequence (a„) 

n-l 

(2"-20)a„ = l-(/. + </.VC;)afe, n>l (7) 



Moreover [9] showed that whenever < < 1/2 then as n — )• 00, 



a. 



n 



(</)), (8) 

where 



(plog 2 

and (i^ = ~|^|^ is the Hausdorff dimension of the generalized Cantor set. 
Here r(-) and C(-) are the Gamma and Riemann zeta functions, respectively. 

Our next theorem gives the rate convergence of Rn to (1 — 2<j)) in terms 
of the Ci norm. 

Theorem 2. For any < (j) < 1/2, as n — > 00 we have 

"'^""(_^^''^^"^ ^2C (10) 

where C {(p) is as in equation ^ and \\-\\-^ is the L\ norm. 
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3 Proof of the theorems 



3.1 Proof of Theorem [T] 

We draw a sample of size n from Cantor {(p) on [0, 1]. Let A'^^^ be the number 
of elements falling in the subinterval [0, (p] and n — Nn in [1 — c^, 1]. From 
the construction Nn ~ Bin In selecting this sample of size n, there 

are three cases which may happen. Some of these points may fall in interval 
[0,(p] and rest in interval [1 — i;^, 1]. That means Nn ^ {0,n}. In this case 
the distance between the points in [0, (p] and [1 — (j), 1] is at least 1 — 2(f). The 
other cases are when all points fall in [0, (p] or all fall in [1 — (/>, 1] , which 
in this case Nn = n or Nn = 0. Let m„ = min Xi, Mn = max Xi and we 

l<j<ri l<i<n 

define 

Ln ■■= max{Xi| I <i <n and Xi £ [0,0]} (11) 

and 

Un := inm{Xi\ 1 < i < n and Xi e [1 - (p, 1]} . (12) 

We will take L„ = (and similarly C/„ = 0) if the corresponding set is empty. 

Now find a K = K{(p) such that (p^ < ^ {1 - (p) {1 - 2(p) . Note that 
such a K < oo exists since < < 1. Let Ii,l2, ■ ■ ■ ,I2k be the 2^ sub- 
intervals of length (p^ which are part of the stage of the "removal of 
middle interval" for obtaining the generalized Cantor set with parameter 
(p. For 1 < j < 2^ define Nj := X^"^^ 1 {Xi G Ij), which is the number 
of sample points in the sub-interval Ij. From the construction of the the 
generalized Cantor distribution with parameter cp it follows that 

ATi^ := (A^i,iV2,...,iV2ic) -Multinomial (n; (-^,-^,--- ,-^)) , (13) 



that 



2K' 2^' '2^ 

E2''- 
Nj. Consider the event En ■= H [Nj > 1]. Observe 

on the event En the maximum inter point distance between two points in 
[0, (p] as well as in [1 - (p, 1] is at most 2(p^ + (1 - 20) < 1 - 20 by the 
choice of K. Thus on En we must have i?„ = Un — Ln and so we can write 

Rn = (Un - Ln) 1e„ + K 1e^„ (14) 

where i?* is a random variable such that < i?* < (p a.s. 

Observe that conditioned on [A''i = ri, N2 = r2, . . . , N2k = ] we have 

Un = ^ — (p + (pmn-k and Ln = 0Mfc and Nn = k where k = rj. More 

ijC[o,4>] 

generally 

{{Ln, Un) , NK)n>l = {{<PMn„,1 - <P + Cpm 
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Note that to be technically correct we define Mq = mo = 0. 

Now it is easy to see that rUn — > and M„ — > 1 a.s. But by the SLLN, 
Nn/n — >■ 1/2 a.s., thus both {Nn) and (n — iV„) are two subsequences which 
are converging to infinity a.s. Moreover 

/ 1 \" 

P(£;;^)<^P(iV,=0) = 2^ 1--^ =2^exp(-a^n) , (16) 

where aK = - log - ^) > 0. Thus X;r=i i^n) < oo, so by the First 
Borel-Cantelli Lemma we have 

P {E^ infinitely often ) = F {En eventually ) = 1 . 

In other words 1e„ — > 1 a.s. and — > a.s. Finally observing that 
< Rn < we get from equations (fH|) and (fT5]l 

i?„ ^ (1 - 20) . 

□ 



3.2 Proof of Theorem [2] 

We start by observing 

E[\Rn-{l-2<P)\] 
= E [{Rn - (1 - 24>)) IeJ + E [IK - (1 - 20)1 Iec] 

= E[{Un -Ln-{1- 20)) lE^lK<N^<n-K]+E[\Rl " (1 " 20)| lijc] 
= E [{Un - Ln- {1 - 20)) l/^<Ar„<n-A'] 

-E [{Un - L„ - (1 - 20)) l£;cl;^<;v„<n-K] + E [K " (1 " 20)| liJc] 
= E [([/„ -Ln-{1- 20)) ll<JV„<n-l] 

-E [([/„ - L„ - (1 - 20)) li=;eli<iv„<n-i] +1E [IK - (1 - 20) | l^c] . 

(17) 

In above the first equality holds because of (|14p and the fact that on the 
event En we must have Rn > 1 — 20. Second, third and forth equalities 
follows from the simple fact that En [K < Nn < n — K]. 

Now recall that a„ = E[m„] so for the first part of the right-hand side 
of the equation (|17p we can write 

E [{Un -Ln-{1- 20)) ll<^„<„_l] 

= -A^[(2"-20)a„-(l-0)] , (18) 
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where the last equahty follows from ([Tj). The other two parts of the right- 
hand side of the equation pT|) are bounded in absolute value by 

F{E^) < 2^exp(-ai^n) 

because of (|15|) . Now observe that from equation ([S]) we get that a„ ~ 
i_ 

C {</)) n '^'^ where = "j^f^ is the Hausdorff dimension of the generalized 
Cantor set. Thus using pzp and (fTBjl we conclude that 



This completes the proof using ([5]). □ 



4 Final Remarks 

It is worth noting here that our proofs depend on the recursive nature of 
the generalized Cantor ditsribution (see equation dH). Thus unfortunately, 
they do not have obvious extensions for other singular distributions. It 
will be interesting to derive a version of Theorem [T] for a general singular 
distribution with no mass and flat parts. Intuitively it seems that the final 
limit should be the length of the longest flat part. It will be more interesting 
if Theorem [2] can also be generalized for general singular distributions with 
no mass and flat parts where (1 — 20) is replaced by the length of the longest 
flat part and d^f, is replaced by the Hausdorff dimension of the support. 
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